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The bibliography appearing at the end of this article lists 37 sorting algorithms 
and 100 books and papers on sorting published in the last 20 years. The basic 
ideas presented here have been abstracted from this body of work, and the beat 
algorithms known are given as examples. As the algorithms are explained, 
references to related algorithms and mathematical or experimental analyses are 
given. Suggestions are then made for choosing the algorithm best suited to a 
given situation. 

Key words and phrases: sorting 

CR category: 5.31 



INTRODUCTION 

Sorting is used to put items in order. The 
sorting algorithms themselves are not diffi- 
cult to understand, but a comparison of the 
relative merits of the many algorithms does 
require some effort. In fact, the question of 
when an ordering is required is not a simple 
one: for example, a file that is best main- 
tained in sorted order when stored in mag- 
netic tape might be kept more efficiently on 
disk with a scatter storage technique. 
Whether or not a file should be sorted de- 
pends on how it is to be used, the extent to 
which the storage medium can be randomly 
accessed, and the statistics of the particular 
item of information on which the file might 
be sorted. Once the various sorting algo- 
rithms have been analyzed, one can see how 
these factors come into play. 

In data management applications it is 
customary to define a file as a collection of 
records, and a record as consisting of one or 
more information groups. Each information 
group may contain several items of informa- 
tion. Records within a file are often sorted, 

* The work herein was supported in part by Proj- 
ect MAC, an MIT research project sponsored by 
the Advanced Research Projects Agency, Depart- 
ment of Defense, under Office of Naval Research 
Contract N0OO14-7O-A-0362-00O1. 



and sometimes information groups and in- 
dividual items of information are sorted as 
well. Records are sorted by identifying a 
particular item of information in the record 
as the key; the records are then sorted into 
the ascending or descending order of their 
keys. Most sorting schemes involve moving 
the elements to be sorted from one place to 
another. The elements are generally moved 
several times before the final sorted order is 
achieved. Thus, when sorting records it may 
be better either to sort their keys first and 
then move the records into the final sorted 
order, or to use the sorted keys as an index 
to the records. Questions of this type will be 
considered in the latter sections of this 
paper, after presentation of the various 
sorting algorithms. 

No one sorting technique is best for every 
situation. The fastest methods are more 
difficult to program and are not considered 
worth the effort for a few short lists of 
numbers. Even if programming effort is not 
a consideration, the choice of method would 
depend on: the length of the list to be sorted; 
the relation between the length of the list 
and the number of cells in the main memory 
of the machine used for sorting; the number 
and size of any disk or tape units used in the 
sort; the extent to which the elements are 
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The Use of Information in Sorting 



H. LYNN BEUS 

General Electric Company* Schenectady, New York 

abstract. The information-gathering aspect of sorting is considered from a theoretical 
viewpoint. A large class, R, of sorting algorithms is defined, based on the idea of information 
use. Properties of this algorithm class are developed, and it is noted that several well-known 
sorting algorithms are closely related to algorithms in R t The Binary Tree Sort is shown to be 
in R and to have unique properties in this class. A vector is defined which characterizes the 
information-gathering efficiency of the algorithms of R. Finally, a more general class of algo- 
rithms is defined, and some of the definitions extended to this class. Two intriguing conjectures 
are given which appear to require graph theory or combinatorial topology for their solution. 

key words and phbasbs : sorting, sorting efficiency, sorting theory, sorting algorithms, infor- 
mation use, graph theory, combinatorial analysis, binary tree 
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1. Introduction and Basic Definitions 

In the last twenty years the introduction of digital computers has spurred interest 
in the problems of sorting or ordering sets of data. The stream of papers that began 
with Mauchly [8] and Goldstine and Von Neumann [5] have, in the main, presented 
results of individual algorithms for sorting, rather than general results concerning 
the overall sorting problem. There have been several exceptions to this, notable 
among which are [2, 3]. 

In this paper we restrict ourselves to the study of one aspect of the sorting process 
in an attempt to discover some unifying results which will apply to a great many 
algorithms. It has been pointed out that sorting can be considered as consisting of 
two intermixed processes: information gathering and ordering. In the first the items 
are compared in some way to gain information about their relative order. In the 
second the information is used to carry out the actual ordering of the items. It is 
the first of these processes that we study here in an attempt to obtain some meas- 
ure of the efficiency of various algorithms in information-gathering. 

In this discussion we call the set of items to be sorted a sort set. Herein such sets 
are arbitrary in general, but fixed for each particular application. We use x to 
denote such a standard sort set. Then x is a set of n items to be sorted, and if a, b 
are two items of x, ra and irb denote the values of their keys (on which the sorting 
is being done). We assume that items of x are numbered 1,2, • * * , n, and that the 
key values also are integers from the set ( 1, 2, *• * , n\ . I n denotes the set of integers 
{ 1, 2, 3, • • • ,»), and | G \ the cardinality of a set G. 

Given a sort set a; to be sorted by an algorithm A, there are n! possible sequences 
of key values associated with the items of x. We define the efficiency, wA, of the 

This paper is a revision of the author's Ph.D. thesis (Computing Center Rep. No. 1096) at 
Case Western Reserve University, Cleveland, Ohio, Nov. 1967. 
* Research and Development Center. 

Journal of the Association for Computing Machinery, Vol. 17. No. 3, July 1970. pp. 482-495. 
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